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Amendments To The Claims: 

This listing of claims will replace all prior versions and listings of claims in the 
application: 

Claims 1-111. (cancelled) 

Claim 112. (previously presented) A method comprising: 

(a) providing first data from a first set of samples wherein: 

(i) the first set of samples comprises a plurality of samples classified 
into a first biological state class and a plurality of samples classified into a 
second biological state class; 

(ii) the data from the first set of samples comprises a plurality of data 
elements, each data element characterized by a value, wherein all of the 
samples share a plurality of common data elements; 

(b) performing multivariate analysis on the first data to qualify each common 
data element in the first data based on the ability of the data element to classify a 
sample into the first biological state class or the second biological state class, 
wherein data element values are qualified using a classification model; 

(c) selecting a first subset of qualified common data elements from the first 
data; 

(d) providing second data from a second set of samples wherein: 

(i) the second set of samples comprises a plurality of samples 
classified into the first biological state class and a plurality of samples 
classified into the second biological state class; 

(ii) the data from the second set of samples comprises a plurality of 
data elements, each data element characterized by a value, wherein all of 
the samples share the plurality of common data elements; 

(iii) the first set of samples and second set of samples come from first 
and second populations that have a statistically significant difference with 
respect to at least one preanalytical variable; 
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(e) performing multivariate analysis on the second data to qualify each 
common data element in the second data based on the ability of the data 
element to classify a sample into the first biological state class or the second 
biological state class, wherein data element values are qualified using a 
classification model; 

(f) selecting a second subset of qualified common data elements from the 
second data; 

(g) selecting an intersection subset of data elements from the first and 
second subsets, wherein each data element in the intersection subset is a 
member of both of the first and second subsets; and 

(h) displaying the intersection subset on a graphical display interface on a 
user device. 

Claim 113. (previously presented) The method of claim 112 wherein the first 
and second populations have a statistically significant difference with respect to a 
preanalytical variable selected from the group consisting of gender, age, ethnicity, 
sample collection parameter, sample processing parameter, weight, diet, medication 
status, medical condition, amount of physical exercise, pregnancy, level of circulating 
antibodies and a clinical characteristic. 

Claim 114. (previously presented) The method of claim 113 wherein the first 
and second populations have a statistically significant difference with respect to a 
plurality of preanalytical variables selected from said group. 

Claims 115. (previously presented) The method of claim 1 1 2 wherein the first 
samples and the second samples are collected from different geographical locations. 

Claims 116. (previously presented) The method of claim 1 1 2 wherein the first 
samples and the second samples are collected from different clinical trial sites. 
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Claim 117. (previously presented) The method of claim 112 wherein the step 
of selecting the first and second subsets comprises using the discovery data sets to 
train a learning algorithm wherein the learning algorithm ranks the data elements based 
on a quantitative measure of ability to classify. 

Claim 1 18. (previously presented) The method of claim 1 1 7 wherein 

the learning algorithm is a supervised learning algorithm. 

Claim 119. (Canceled) 

Claim 1 20. (previously presented) The method of claim 1 1 7 wherein 

the training comprises using support vector machine analysis. 

Claim 121. (Canceled) 

Claim 122. (Canceled) 

Claim 123. (previously presented) The method of claim 112 further 

comprising independently re-sampling data elements in each data set. 

Claim 1 24. (previously presented) The method of claim 1 1 2 further 

comprising, selecting candidate biomarkers from selected data elements and testing 
one or more of the candidate biomarkers on a validation data set. 

Claim 125. (Canceled) 

Claim 126. (Canceled) 

Claim 1 27. (previously presented) The method of claim 1 1 2 wherein 

the biological state class is selected from the group consisting of: presence of a 
disease; absence of a disease; progression of a disease; risk for a disease; stage of 
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disease; likelihood of recurrence of disease; a genotype; a phenotype; exposure to an 
agent or condition; a demographic characteristic; resistance to agent, sensitivity to an 
agent, and combinations thereof. 

Claim 128. (Canceled) 

Claim 129. (Canceled) 

Claim 130. (Canceled) 

Claim 1 31 . (previously presented) The method of claim 1 24 wherein 

the one or more candidate biomarkers are diagnostic of the presence of a disease, risk 
of developing a disease, risk of recurrence of a disease, or stage of the disease. 

Claim 1 32. (previously presented) The method of claim 1 1 2 wherein 

values of the data elements in a data point represent levels and/or frequency of 
components in a data point sample. 

Claim 133. (previously presented) The method of claim 132 wherein 

components are selected from the group consisting of: nucleic acids, proteins, 
polypeptides, peptides, carbohydrates and modified or processed forms thereof. 

Claim 1 34. (previously presented) The method of claim 1 1 2 wherein 

levels of components are measured by an expression profiling assay. 

Claim 135. (Canceled) 

Claim 136. (Canceled) 
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Claim 137. (previously presented) The method of claim 134 

wherein the expression profiling assay comprises measuring the amount and/or form of 
a protein, polypeptide or peptide. 

Claim 138. (previously presented) The method of claim 137 wherein 

the expression profiling assay comprises mass spectrometry. 

Claim 139. (previously presented) The method of claim 138 wherein 

the expression profiling assay comprises SELDI analysis. 

Claim 140. (Canceled) 

Claim 141. (previously presented) The method of claim 112 

wherein expression profiling comprises: 

(a) contacting samples with a substrate comprising binding partners for 
specifically binding to sample components having selected characteristics 
and 

(b) identifying sample components bound to the substrate. 

Claim 142. (previously presented) The method of claim 141 

wherein binding partners are selected from the group consisting of cationic molecules; 
anionic molecules; metal chelates; antibodies; single- or double-stranded nucleic acids; 
proteins, peptides, amino acids; carbohydrates; lipopolysaccharides; sugar amino acid 
hybrids; molecules from phage display libraries; biotin; avidin; streptavidin; and 
combinations thereof. 

Claim 143. (previously presented) The method of claim 141 

wherein the binding partners are arrayed on the substrate. 



Claim 144. (previously presented) The method of claim 117 wherein 

an assay used to measure levels of data elements in training data sets from which 

BOS2 780496.1 g 



Z. Zhang et al. 
U.S.S.N. 10/635,241 
Page 7 

candidate biomarkers are identified is different from an assay used to measure data 
elements in a validation data set used to validate the candidate biomarker. 

Claim 145. (previously presented) The method of claim 140 

wherein the assay used to measure levels of data elements in training data sets is 
SELDI. 

Claim 146. (Canceled) 

Claim 147. (previously presented) The method of claim 112 

wherein the independent discovery data sets are collected from different locations, 
using different collection protocols, and/or are collected from different populations. 

Claim 148. (previously presented) The method of claim 112 

wherein each discovery data set is from a different clinical trial site. 

Claim 149. (currently amended) A computer program product e mbod ie d 
i n a wr i tt e n, ele ctron i c, magn e t i c or opt i ca l phys i ca l m e d i a, wh e r e th e comput e r 
program product i s executed with a computer, comprising: 

(a) receiving input data from an input device of at least first and second 
independent discovery data sets wherein: 

(i) the data sets comprise a plurality of forms of biological state 
classes; 

(ii) each data set comprises a plurality of data points, wherein 
each data point exhibits one form of a biological state class 
and each data set comprises a plurality of data points 
belonging to each of the classes; and 

(iii) each data point comprises a plurality of data elements, each 
data element characterized by a value, wherein all data points 
share a plurality of common data elements; 
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(b) a second computer readable program providing instructions for 
qualifying each common data element, independently for each data set, based 
on the ability of the data element to classify a data point into a biological state 
class, wherein the ability of the data element to classify a data point into a 
biological state class is a function of data element value and for selecting an 
initial subset of data elements within each data set, and 

(c) a third computer readable program providing instructions for 
selecting an intersection subset of data elements from the initial subsets, 
wherein each data element in the intersection subset is a member of a majority 
of the initial subsets, wherein inputted data is displayed on a graphical display 
interface on a user device connected to a computer. 

Claim 150. (previously presented) The computer program product 

of claim 149 wherein selecting the initial subsets comprises using the discovery data 
sets to train a learning algorithm wherein the learning algorithm ranks the data elements 
based on a quantitative measure of ability to classify. 

Claim 151 . (previously presented) The computer program product 

of claim 149 wherein the learning algorithm is a supervised learning algorithm. 

Claim 152. (previously presented) The computer program product 

of claim 149 wherein the learning algorithm is an unsupervised learning algorithm. 

Claim 153. (previously presented) The computer program product 

of claim 1 50 wherein training comprises support vector machine analysis. 

Claim 154. (Canceled) 

Claim 155. (Canceled) 



Claim 156. (Canceled) 
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Claim 157. (previously presented) The computer program product 

of claim 149 further comprising program code for independently re-sampling data 
elements in each data set. 



Claim 158. (previously presented) The computer program product 

of claim 149 further comprising program code for selecting candidate biomarkers based 
on ranking by the learning algorithm and for testing one or more of the candidate 
biomarkers on a validation data set. 



Claim 159. (Canceled) 
Claim 160. (Canceled) 



Claim 161 . (previously presented) The computer program product 

of claim 149 wherein the biological state class is selected from the group consisting of: 
presence of a disease; absence of a disease; progression of a disease; risk for a 
disease; stage of disease; likelihood of recurrence of disease; a genotype; a phenotype; 
exposure to an agent or condition; a demographic characteristic; resistance to agent, 
sensitivity to an agent, and combinations thereof. 



Claim 162. (Canceled) 
Claim 163. (Canceled) 
Claim 164. (Canceled) 



Claim 165. (previously presented) The computer program product 

of claim 1 58 wherein the one or more candidate biomarkers are diagnostic of the 
presence of a disease, risk of developing a disease, risk of recurrence of a disease, or 
stage of the disease. 
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Claim 166. (previously presented) The computer program product 

of claim 161 wherein values of the data elements in a data point represent levels and/or 
frequency of components in a data point sample. 

Claim 167. (previously presented) The computer program product 

of claim 161 wherein components are selected from the group consisting of: nucleic 
acids, proteins, polypeptides, peptides, carbohydrates and modified or processed forms 
thereof. 

Claim 168. (previously presented) The computer program product 

of claim 160 wherein levels of components are measured by an expression profiling 
assay. 

Claim 169. (Canceled) 
Claim 170. (Canceled) 

Claim 171 . (previously presented) The computer program product 

of claim 168 wherein the expression profiling assay comprises measuring the amount 
and/or form of a protein, polypeptide or peptide. 

Claim 172. (previously presented) The computer program product 

of claim 168 wherein the expression profiling assay comprises mass spectrometry. 

Claim 173. (previously presented) The computer program product 

of claim 168 wherein the expression profiling assay comprises SELDI analysis. 

Claim 174. (Canceled) 
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Claim 175. (previously presented) The computer program product 

of claim 168 wherein expression profiling comprises: 

(a) contacting samples with a substrate comprising binding partners for 
specifically binding to sample components having selected characteristics; 
and 

(b) identifying sample components bound to the substrate. 

Claim 176. (previously presented) The computer program product 

of claim 175 wherein binding partners are selected from the group consisting of cationic 
molecules; anionic molecules; metal chelates; antibodies; single- or double-stranded 
nucleic acids; proteins, peptides, amino acids; carbohydrates; lipopolysaccharides; 
sugar amino acid hybrids; molecules from phage display libraries; biotin; avidin; 
streptavidin; and combinations thereof. 

Claim 177. (previously presented) The computer program product 

of claim 149 wherein an assay used to measure levels of data elements in training data 
sets from which candidate biomarkers are identified is different from an assay used to 
measure data elements in a validation data set used to validate the candidate 
biomarker. 

Claim 178. (previously presented) The computer program product 

of claim 149 wherein the assay used to measure levels of data elements in training data 
sets is SELDI. 

Claim 179. (Canceled) 

Claim 180. (previously presented) The computer program product 

of claim 149 wherein the independent discovery data sets are collected from different 
locations, using different collection protocols, and/or are collected from different 
populations. 
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Claim 181 . (previously presented) The computer program product 

of claim 149 wherein each discovery data set is from a different clinical trial site. 

Claim 182. (previously presented) A system comprising: 
one or more processors for 

(a) receiving input data comprising at least first and second independent 
discovery data sets wherein: 

(i) the first set of samples comprises a plurality of samples 
classified into a first biological state class and a plurality of samples 
classified into a second biological state class; 

(ii) the data from the first sample set comprises a plurality of 
data elements, each data element characterized by a value, wherein all of 
the samples share a plurality of common data elements; 

(iii) the second set of samples comprises a plurality of samples 
classified into the first biological state class and a plurality of samples 
classified into the second biological state class; 

(iv) the data from the second sample set comprises a plurality of 
data elements, each data element characterized by a value, wherein all of 
the samples share the plurality of common data elements; 

(b) executing computer readable program code providing instructions for 
qualifying each common data element, independently for each data set, based 
on the ability of the data element to classify a data point into a biological state 
class, wherein data element values are qualified using a classification and for 
selecting an initial subset of data elements within each data set; and 

(c) executing computer readable program code providing instructions for 
selecting an intersection subset of data elements from the initial subsets, 
wherein each data element in the intersection subset is a member of a majority 
of the initial subsets and, wherein the results of the programs are displayed on a 
graphical display. 
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Claim 183. (previously presented) The system of claim 182 further 

comprising one or more devices for providing input data to the one or more processors. 

Claim 1 84. (previously presented) The system of claim 1 82 wherein 

the one or more devices for providing input data comprises a detector for detecting a 
characteristic of a data element. 



Claim 185. (previously presented) 
the detector comprises a mass spectrometer. 

Claim 186. (previously presented) 
the detector comprises a gene chip reader. 

Claim 187. (previously presented) 
comprising a memory for storing a data set of 



The system of claim 182 wherein 



The system of claim 1 82 wherein 



The system of claim 1 82 further 
ranked data elements. 



Claim 188. (previously presented) The system of claim 182 further 

comprising a database of ranked data elements. 

Claim 189. (previously presented) The system of claim 182 wherein 

selecting the initial subsets comprises using the discovery data sets to train a learning 
algorithm wherein the learning algorithm ranks the data elements based on a 
quantitative measure of ability to classify. 

Claim 190. (previously presented) The system of claim 189 wherein 

the learning algorithm is a supervised learning algorithm. 

Claim 191. (Canceled) 



BOS2 780496.1 



13 



Z. Zhang et al. 
U.S.S.N. 10/635,241 
Page 14 

Claim 192. (previously presented) The system of claim 189 wherein 

training comprises support vector machine analysis. 

Claim 193. (Canceled) 

Claim 194. (Canceled) 

Claim 195. (Canceled) 

Claim 196. (previously presented) The system of claim 182 wherein 

the system further executes program code for independently re-sampling data elements 
in each data set. 

Claim 197. (previously presented) The system of claim 189 wherein 

the system further executes program code for selecting candidate biomarkers based on 
ranking by the learning algorithm and for testing one or more of the candidate 
biomarkers on a validation data set. 

Claim 198. (Canceled) 

Claim 199. (Canceled) 

Claim 200. (previously presented) The system of claim 1 82 wherein 

the biological state class is selected from the group consisting of: presence of a 
disease; absence of a disease; progression of a disease; risk for a disease; stage of 
disease; likelihood of recurrence of disease; a genotype; a phenotype; exposure to an 
agent or condition; a demographic characteristic; resistance to agent, sensitivity to an 
agent, and combinations thereof. 

Claim 201 . (Canceled) 
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Claim 202. (Canceled) 
Claim 203. (Canceled) 

Claim 204. (previously presented) The system of claim 197 wherein 

the one or more candidate biomarkers are diagnostic of the presence of a disease, risk 
of developing a disease, risk of recurrence of a disease, or stage of the disease. 

Claim 205. (previously presented) The system of claim 182 wherein 

values of the data elements in a data point represent levels and/or frequency of 
components in a data point sample. 

Claim 206. (previously presented) The system of claim 205 wherein 

components are selected from the group consisting of: nucleic acids, proteins, 
polypeptides, peptides, carbohydrates and modified or processed forms thereof. 

Claim 207. (previously presented) The system of claim 205 wherein 

levels of components are measured by an expression profiling assay. 

Claim 208. (Canceled) 

Claim 209. (Canceled) 

Claim 21 0. (previously presented) The system of claim 207 wherein 

the expression profiling assay comprises measuring the amount and/or form of a 
protein, polypeptide or peptide. 

Claim 21 1 . (previously presented) The system of claim 207 wherein 

the expression profiling assay comprises mass spectrometry. 
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Claim 212. (previously presented) The system of claim 214 wherein 

the expression profiling assay comprises SELDI analysis. 

Claim 213. (Canceled) 

Claim 214. (previously presented) The system of claim 207 wherein 

expression profiling comprises: 

(a) contacting samples with a substrate comprising binding partners for 
specifically binding to sample components having selected characteristics 
and 

(b) identifying sample components bound to the substrate. 

Claim 21 5. (previously presented) The system of claim 214 wherein 

binding partners are selected from the group consisting of cationic molecules; anionic 
molecules; metal chelates; antibodies; single- or double-stranded nucleic acids; 
proteins, peptides, amino acids; carbohydrates; lipopolysaccharides; sugar amino acid 
hybrids; molecules from phage display libraries; biotin; avidin; streptavidin; and 
combinations thereof. 

Claim 216. (previously presented) The system of claim 1 82 wherein 

an assay used to measure levels of data elements in training data sets from which 
candidate biomarkers are identified is different from an assay used to measure data 
elements in a validation data set used to validate the candidate biomarker. 

Claim 21 7. (previously presented) The system of claim 216 wherein 

the assay used to measure levels of data elements in training data sets is SELDI. 

Claim 218. (Canceled) 
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Claim 219. (previously presented) The system of claim 182 wherein 

the independent discovery data sets are collected from different locations, using 
different collection protocols, and/or are collected from different populations. 

Claim 220. (previously presented) The system of claim 182 wherein 

each discovery data set is from a different clinical trial site. 

Claim 221 . (previously presented) The method of claim 1 1 2 wherein the 
multivariate analysis on the first data comprises use of a pattern recognition process. 

Claim 222. (previously presented) The method of claim 221 wherein the 
pattern recognition process comprises a classification model. 

Claim 223. (previously presented) The method of claim 112 wherein the 
multivariate analysis on the second data comprises use of a pattern recognition 
process. 

Claims 224. (previously presented) The method of claim 223 wherein the 
pattern recognition process comprises a classification model. 

Claim 225. (previously presented) The computer program product of claim 
149, wherein the computer is a digital computer. 

Claim 226 (previously presented) The computer program product of claim 
149, wherein the computer is further connected to a server. 
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